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SMALL NOISE ASYMPTOTIC OF THE TIMING JITTER IN 
SOLITON TRANSMISSION 

By Arnaud Debussche and Eric Gautier 

ENS Cachan Bretagne 

We consider the problem of the error in soliton transmission 
in long-haul optical fibers caused by the spontaneous emission of 
noise inherent to amplification. We study two types of noises driving 
the stochastic focusing cubic one dimensional nonlinear Schrodinger 
equation which appears in physics in that context. We focus on the 
fluctuations of the mass and arrival time or timing jitter. We give 
the small noise asymptotic of the tails of these two quantities for the 
two types of noises. We are then able to prove several results from 
physics among which the Gordon-Haus effect which states that the 
fluctuation of the arrival time is a much more limiting factor than 
the fluctuation of the mass. The physical results had been obtained 
with arguments difficult to fully justify mathematically. 

1. Introduction. The nonlinear Schrodinger (NLS) equation occurs as 
a generic model in many areas of physics and describes the propagation 
of slowly varying envelopes of a wave packet in media with both weakly 
nonlinear and dispersive responses; see [35] for a detailed presentation. The 
one-dimensional equation with a cubic focusing nonlinearity, for example, 
has the form 

r)u u ° 

(1.1) i^^ = Au Uo + \u Uo \ 2 u U0 , 

where u u ° is a complex valued function depending on t > and iGl and 
the superscript no means that u u °(0,x) = uq(x),x £ R. This equation is a 
very accurate model in the context of single-mode optical fibers over short 
distances. A derivation of the equation in that context is given, for example, 
in [24]. Resulting from a balance between the focusing nonlinearity and the 
dispersive linear part, localized stationary waves propagate. They are called 
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solitons and have the form \^2Asech(A(x — xq)) exp(— iA 2 t + i9q), where 
A > is the amplitude, xq and 9q are respectively the initial position and 
phase. By extension, we herein also call solitons the following nonstationary 
progressive solutions: 

(1.2) V2AsecYi{A{x - x ) + 2AVt) exp(-i(A 2 - V 2 )t + iV(x - x ) + i9 ), 

where V is the group velocity or angular carrier frequency. 

In soliton based amplitude-shifted-keyed (ASK) communication systems, 
solitons are used as information carriers to transmit the datum or 1. A 1 
corresponds to the emission of a soliton at time with zero velocity ^^(x) = 
\/2Asech(Ax). It is produced by a laser beam. At coordinate T (end of the 
line) a receiver records 

r l/2 

(1/Z) / \u U0 (T,x)\ 2 dx, u = or u = V° A . 

J -1/2 

In optics the usual x variable of the NLS equation denotes some retarded 
time while t is space. Thus, [—1/2,1/2] is a window in time and I may be 
chosen small since the solution u ll ° of the NLS equation is localized and 
remains centered. When the above quantity is above a threshold, it is decided 
that a 1 has been emitted, otherwise it is decided that a has been emitted. 

Over long distances, damping induced low loss becomes significant and 
the signal has to be amplified. However, due to quantum considerations, am- 
plification is intrinsically associated with small noise; see [15] for a physical 
justification of noise in optics. Simply stated, due to the Heisenberg princi- 
ple, there is inherently uncertainty on the amplified signal. This uncertainty 
is accounted by noise in the system. This phenomenon is called spontaneous 
emission of noise. These intrinsic quantum features can have direct macro- 
scopic consequences such as fluctuation of the arrival time, also called timing 
jitter or diffusion of the soliton. A practical consequence for engineering is 
error in soliton transmission. 

We consider in this article two specific models from the physics literature 
on the topic which are stochastic PDEs (SPDEs). A first type of amplifi- 
cation, and most discussed, is the case of regularly spaced Erbium-Doped 
amplifiers placed along the line and such that the distance between am- 
plifiers is small compared to the length of the line. The limit case where 
there is a continuum of amplifiers is called distributed amplification. In that 
case the noise (Gordon-Haus noise) acts as a random external force; see, for 
example, [14, 19, 31]. There, the following equation is used: 

(1.3) *^r~ = Au£,uo + ' u£, "°' + 

where e stands for the small noise amplitude, T is a complex Gaussian space- 
time white noise and uq is again the initial datum. 
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A formal derivation of this model is proposed in the above references. 
In particular, it is argued that it can be assumed that the damping term 
is exactly balanced by the amplifiers so that these effects do not appear 
directly in the model. Only the noise remains. Note also that this equation 
also appears in the context of anharmonic atomic chains in the presence of 
thermal fluctuation; see, for example, [5] where timing jitter is also studied. 
In that second case the derivation can probably be done in a more rigorous 
way; this will be the object of future work. 

If we consider the recently studied Raman coupling to the thermal phonon 
(see [7, 15, 16, 17, 29]), or four-wave-mixing (see [15, 30]), another quan- 
tum noise appears and depends on the pulse intensity. It is modeled as a real 
multiplicative noise. The first physical derivation of the equation is obtained 
in [16]. Note that in the case of the Raman amplification, an extra Raman 
nonlinear response appears in the equation. As in the above physical refer- 
ences, we drop it since it is assumed to have a limited effect on the noise 
induced timing jitter. The following model is used: 

(1.4) i—^- = Au £ ' u ° + \u £ ' u ° \ 2 u £ > U0 + v^'Tfl. 

Here the noise Tr is a real Gaussian noise and the product is a Stratonovich 
product. An important feature of this type of noise is that the mass, given 
mathematically by the square of the L 2 norm, is a conserved quantity. The 
stochastic NLS equation with real multiplicative noise is also used in the 
context of crystals; see, for example, [2, 3]. 

Unlike the deterministic case, an initial soliton profile is progressively 
distorted due to noise. As a consequence, with a probability that is expected 
to be small, an error in transmission occurs. It is an important issue to derive 
theoretical tools to estimate this probability. A first type error occurs when 
a soliton is emitted and at the other end of the fiber it is not detected. Two 
phenomena may induce such error. The mass 

N(4>) = H\\h W> e l 2 , 

which is an invariant quantity without noise fluctuates when an additive 
noise is taken into account. Thus, when the noise is additive, the signal may 
not be detected due to a decrease of the mass. The second source of error is 
the so-called timing jitter. The arrival time is defined as 

Y (<(>) = [ x\cf)(x)\ 2 dx V</>EL 2 . 
Jr 

Without noise, the signal is centered at time x = and the arrival time 
is zero. The noise may change the arrival time and shift the signal outside 
the measuring window [—1/2,1/2]. From these considerations, the problem is 
reduced to estimate the probability that the mass has decreased significantly 
or that the arrival time has changed significantly. 
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Similarly, when no signal is emitted, an additive noise may create from 
nothing a signal with high enough mass at T and that might be mistaken 
as a 1. When the noise is multiplicative, because the mass is invariant, we 
only have to consider the loss of a 1 due to timing jitter. 

The aim of the paper is to apply probabilistic tools, more specifically, 
large deviations estimates to evaluate theoretically the probability of large 
fluctuations of the mass and arrival time. The large fluctuations events are 
indexed by R positive, for example, large fluctuations of the arrival time 
correspond to {Y(u £ > u (T)) > R} or {Y(u £ ' u °(T)) < -R}. Using large de- 
viation techniques is justified by the standard assumption in the physics 
literature that the noise is small. We prove that, as usual in that context, 
the large deviation probabilities are related to an optimal control problem. 
They are deduced by contraction from a large deviation principle at the 
level of the paths. Our aim is to give precise upper and lower bounds of 
these large deviation probabilities. 

We get lower bounds by minimizing the rate function over a small set 
of paths. Namely, we take paths which are modulated solitons, that is, soli- 
tons with time varying parameters. Upper bounds are obtained using energy 
inequalities. 

In the physics literature a different method is used. It relies on an adi- 
abatic perturbation theory, (see, e.g., [25, 26]), where the pulse is approx- 
imated by a soliton ansatz with finite fluctuating collective variables. In 
other words, the stochastic NLS equation is replaced by a finite number of 
coupled stochastic differential equations for the soliton parameters. Thus, 
the original infinite dimensional problem is reduced to a finite dimensional 
one for which powerful methods can be used. 

It seems very difficult to justify theoretically this method. Our argument 
is rigorous. Soliton ansatz are also used, but only to provide lower bounds. 
Surprisingly, the upper bound which is obtained in a totally different way 
is of the same order as the lower bound with respect to different physically 
relevant parameters: the length of the fiber T, the initial amplitude of the 
signal A and the parameter R indexing the large fluctuation event. Moreover, 
our results are comparable to the ones available in the physics literature. 

We recover, for instance, the fact that the law of the mass is not Gaus- 
sian. Concerning the arrival time, the order in R we obtain proves that the 
log of the tails are undistinguishable from the log of Gaussian tails. The 
Gaussianity of the arrival time is a well studied issue in physics. In [24], 
assuming that the timing jitter is the most troublesome process, an upper 
limit of the information rate is derived based on the Gaussian assumption 
and variance computations. In [33] the log of the tails of the amplitude and 
arrival time are evaluated numerically via an importance sampling Monte 
Carlo estimator and using an ansatz approximation. It is obtained that the 
log of tails of the arrival time is the same as the log of a Gaussian tail, while 
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the log of the tail of the amplitude differs significantly from that of Gaussian 
tails. In [1, 14, 28] it is shown that the arrival time can be considered as 
Gaussian in the first order only while in [14, 20, 32, 37] justifications for a 
deviation from Gaussianity when there is filtering or soliton interaction are 
given. 

Assuming that the seemingly Gaussian arrival time is indeed Gaussian, 
we obtain the same order in T as physicists. In [5, 17, 24] the variance of 
the arrival time is studied. In [24] it is proved to be of the order of T 3 
(super diffusion) and the timing jitter is connected to a shift in the soliton 
carrier frequency which we exploit in the construction of ansatz. In [17], 
where both independent complex additive and real multiplicative noises ap- 
pear in the equation, the contribution of each noise to the variance of the 
arrival time is also of the order T 3 . 

We are also able to compare the tails of the arrival time to the tails of 
the mass at the end of the line when the noise is additive. We obtain that 
the tails of the arrival time are thicker than that of the mass. Thus, timing 
jitter is the dominant factor as suggested by Gordon and Haus in [24]. 

Let us also mention that recent articles [14, 19, 31] give approximate 
PDF of the mass, as well as of the joint law of the mass and arrival time at 
T with initial datum \E'^. Our results compare to theirs in many ways. In 
the first article the PDF is obtained using the Fokker-Planck equation and 
again approximating the pulse by a soliton with finite random modulations 
evolving according to dynamically coupled SDEs. In [19] the PDF is obtained 
via a saddle point approximation of a finite dimensional approximation of the 
infinite dimensional Martin-Siggia-Rose effective action, relying on ansatz. 
Theory for infinite dimensional effective action is developed in [27], but it 
has not been used so far for the problem at hand. These infinite dimensional 
effective actions in physics are intimately related to the rate function of a 
sample path large deviation principle (LDP). Paths minimizing the action 
are then called optimal fluctuations or instantons generalizing the quantum 
mechanics instantons studied in [21] using large deviation techniques. 

With our large deviations approach, we study the tails of the CDF and 
not the bulk of the distribution as with PDFs. The bulk seems less interest- 
ing for a study of the rare events causing error in transmission. We obtain 
accurate rigorous results without using directly the spectral properties of the 
nonlinear Schrodinger operator. Though applied here to the problem of the 
error in transmission and for specific and simplified equations, this approach 
could be used for more general models. Its application to the exit time off 
neighborhoods of the soliton or randomly modulated soliton for stochastic 
Korteweg-de Vries equations will be given elsewhere. 

2. Notation and preliminaries. For p > 1, L p is the classical Lebesgue 
space of complex valued functions on R and W 1 '^ is the associated Sobolev 
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space of L p functions with first order derivatives, in the sense of distributions, 
in L p . If / is an interval of R, (E, \\ ■ \\e) a Banach space and r belongs to 
[1, oo], then L r (/; E) is the space of strongly Lebesgue measurable functions 
/ from I into E (see [18]) such that t -> is in 1/(1). The space L 2 

with the inner product defined by (u,v)^2 = IHe f^u(x)v(x)dx is a Hilbert 
space. The Sobolev spaces H s are the Hilbert spaces of functions of L 2 with 
partial derivatives up to order s in L 2 . When s is fractional it is defined 
classically via the Fourier transform. When the functions are real valued we 
specify it, for example, we write H S (]R,]R). The following Hilbert spaces of 
spatially localized functions 

E = {/£H 1 :iKi/(i)eL 2 }, 

S 1 / 2 = {/£H 1 :ih y/\x\f(x) € L 2 } 
are also introduced and endowed with the norms 

\\f\\l = \\ffm + \\x^xf(x)\\l 2 , 

ll/lllx/^ll/llli + lk^V^/^HL- 

We denote by ||^||£ c (a,b) the norm of $ as a linear continuous operator 
from A to B, where A and B are normed vector spaces. We recall that is a 
Hilbert-Schmidt operator from H to H, where H and H are Hilbert spaces, 
if it is a linear continuous operator such that, given a complete orthonormal 
system (ef )f =1 oiH, ~£jLi \\$ef ||| < oo. We denote by £ 2 (H,H) the space 
of Hilbert-Schmidt operators from H to H endowed with the norm 

oo 

PIU(^)=tr(^*) = Ell $e fll|- 

j'=i 

We also recall that a cylindrical Wiener process W c in a Hilbert space 
H is such that, for any complete orthonormal system {ej)JL l of H, there 
exists a sequence of independent Brownian motions (/3 J -)^ =1 such that W c = 
J2j^=i Pj e j- This sum does not converge in H 1 but in any Hilbert space U such 
that the embedding H C U is Hilbert-Schmidt. The image of the process 
W c by a linear mapping $ on H is a well defined process in H when the 
mapping is Hilbert-Schmidt on H, that is, $ £ C2{H) = C2(H,H). Then, 
W = 3>Wc is such that W(l) is a well defined Gaussian random variable with 
covariance operator A detailed presentation of Hilbert space valued 

Wiener processes, the stochastic integration in that setting and SPDEs is 
given, for instance, in [8], Chapter 4. 

We recall that a rate function / is a lower semicontinuous function and 
that a good rate function / is a rate function such that, for every positive 
c, {x : I{x) < c} is a compact set. 
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Let us now recall some mathematical aspects of the stochastic NLS equa- 
tions. The equations, written as SPDEs in the Ito form, are in the additive 
case 

(2.1) idu £ > u ° - (Au £ ' U0 + \u £ ' u °\ 2 u £ ' u °) dt = ^dW, 
and in the multiplicative case 

(2.2) idu £ ' U0 - (Au £ > u ° + \u £ ' Uo \ 2 u £ ' Uo ) dt = ^u £ ' u ° o dW. 

The symbol o stands for the Stratonovich product. It is convenient to use 
the Ito product so that we write the equivalent Ito form of the equation 

(2.3) idu £ ' uo - (Au £ ' uo + \u £ > U0 \ 2 u £ > U0 )dt = ^u £ ' uo dW - l -eF^u £ ' u \ 

where, given (ej)JL l an orthonormal basis of L 2 , F$(x) =J2'jLi(^ e j) 2 ( x )- 
The term (e/2)F$(x) is the Ito correction necessary to transform the 
Stratonovich product into a Ito one. Note that F$ does not depend on the 
basis. 

As mentioned earlier, in the case of equation (2.3) (see [10]), the mass 

N(u £ ' U0 (t)) = \\u £ > U0 {t)\\l 2 , t>0, 

is a conserved quantity. Precise assumptions on $ such that W = &W C are 
made below. These equations are supplemented with an initial datum 

u £ ' Uo (0) = u . 

In this paper we consider initial data in S C H 1 and work with the solution 
constructed in [10] . Since we work with a subcritical nonlinearity, we could 
also consider solutions in L 2 with initial data in L 2 . However, the H 1 -setting 
is preferred in order to be able to consider the spaces E and E 1 / 2 and study 
the arrival time of the pulse 

Y(u £ ' uo (t))= f x\u £ ' U0 (t,x)\ 2 dx, t>0, 

defined when u £ ' u °(t) belongs to E 1 / 2 . 

We are concerned by weak solutions or, equivalently, by mild solutions 
which, in the additive case, satisfy 

u £ ' ua (t) = U(t)u - i [ U(t - s)(\u £ ' Ua (s)\ 2 u £ ' U0 (s)) ds 

(2-4) ( J ° 

-is/e f U{t-s)dW{s), 
Jo 
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where (U(t))t^m stands for the Schrodinger group, U(t) =e~ itA ,teR. The 
last term is called the stochastic convolution. In the multiplicative case, the 
mild equation is 



where the stochastic integral is an Ito integral. 

The noise is the time derivative in the sense of distributions of the Wiener 
process W. It corresponds to a white noise in time. A space-time white noise 
would correspond to <I> equal to the identity. It is, however, the noise mainly 
considered in optics. We cannot handle such rough noises and make the 
assumption that the noises are colored in space in order to obtain well- 
posedness. The basic limitation is that, unlike semi-groups like the Heat 
semi-group, the Schrodinger group is an isometry and does not allow smooth- 
ing in the Sobolev spaces based on L 2 . For instance, in the additive case, 
it can be seen that the stochastic convolution is a well defined process with 
paths in L 2 if and only if $ is a Hilbert-Schmidt operator on L 2 . In that 
case, however, we will, for computational issues, consider sequences of noises 
that mimic the white noise in the limit. This statement will be made more 
precise. 

In fact, we make even stronger assumptions. In the additive case we as- 
sume that W is a Wiener process in S, in other words, we require that 
<I> G C2(L 2 ,T,). In the multiplicative case, it is imposed that W is a Wiener 
process in EP(IR,IR) where s satisfies s > 3/2. It allows to consider paths in 



We know that the Cauchy problem is globally well posed in H 1 ; see [10] for 
a general discussion on the local well posedness and the global existence for 
more general nonlinearities and dimensions. Note that in the deterministic 
case, the NLS equation considered here is integrable thanks to the inverse 
scattering method. We do not use these techniques in this article. Results on 
the influence of the noise on the blow-up time for more general nonlinearities 
and dimensions are given in [11, 12]. In [4, 13] the ideal white noise and 
results on the influence of a noise on the blow-up are studied numerically. 

Sample path LDPs for stochastic NLS equations are proved in [22, 23]. 
These LDPs are stated in the topology of C([0,T];H ) for T > and do not 
allow to treat the arrival time of the solution. We shall generalize these and 



give LDPs in C([0, T]; £ 1/2 ). The rate function of the LDP in the additive 




(2.5) 





SMALL NOISE ASYMPTOTIC OF THE TIMING JITTER 



9 



case is defined in terms of the mild solution of the control problem 

(2.6) U^ = Au + \u\ 2 u + ^h, 

\u(0) = u £T, and h £ L 2 (0,T;L 2 ). 

We denote the solution by u = S a,u ° (h) . The mapping h — ► S a,u ° (h) is called 
the control map and (2.6) the control equation. 
In the multiplicative case, the control equation is 

_n du ,n 

(2.7) i— = Au + \ufu + u®h, 

whose mild solution is denoted by u = S m,u ° (h). The mapping S m,u ° is again 
the control map and (2.7) the control equation. 

In this article, when describing properties which hold both in the additive 
and multiplicative cases, we use the symbol S(uq, h) to denote either S a ' u °(h) 
or S m ' U0 (h). 

Let us now state the sample path LDPs. As already mentioned, these 
are slight generalizations of the LDPs given in [22, 23]. For the reader's 
convenience, we give the proof for the case of an additive noise in Section 5. 
The case of a multiplicative noise is more involved, but does not present new 
difficulties compared to the proof given in [23] . In order to keep the length of 
the article reasonable, we only give the new ingredients necessary to adapt 
the proof. 

Theorem 2.1. Assume that <£> belongs to £2(L 2 ,S) in the additive case 
and <1> E £2(L 2 , H S (IR, R)) with s > 3/2 in the multiplicative case. Assume 
also that the initial datum uq is in S. Then the solutions of the stochas- 
tic nonlinear Schrddinger equations (2.4) and (2.5) are almost surely in 
C([0,T]; X 1 / 2 ). Moreover, they define C([0, T]; X 1 / 2 ) random variables and 
their laws (//" ' °) £ >o satisfy a LDP of speed e and good rate function 

I U0 {w) = \ inf 11 ^" 2 



where S(uo,-) = S a ' u °(-) in the additive case and S(uq,-) = S m,u °(-) in the 
multiplicative case, and with the convention that inf = oo. It means that, 
for every Borel set B of C([0, T]; T}/ 2 ), we have the lower bound 



- inf r°(w) < lime log P(u e '"° G B) 

_ ° 

and the upper bound 



lime log ¥(u £ ' uo e B) < - \ni_I u °(w). 
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These sample path LDPs allow, for example, to evaluate the probability 
that, originated from a soliton profile 

V A (x) = V2Asech(Ax), 

the random solution be significantly different from the deterministic soliton 
solution 

V A (t,x) = ^° A {x)exp(-iA 2 t). 
Indeed, for 5 and rj positive and e small enough, the LDP implies that 
eM-Ci/e) < F(\\u^a - * a\\cqp,t\-zV*) > S ) ^ exp(-C 2 /e), 

where 

Ci= inf I*A(w)+ri 

«;:|h--5A|l c{[0iT];E i/2 ) ><5 



and 



C 2 = inf 7*a(u,) 

^ : I^-*a|| C([0iT] . e i/2 ) >'5 



Recall that, since the rate function is a good rate function, if B is a closed set 
and inf w ^b I^ a {w) < oo, then there is an / in B, optimal fluctuation, such 
that I^ A (f) = inf W £B I^ A (w). Thus, if B does not contain the deterministic 
solution, then necessarily inf^ e ^ I^ A (w) > 0. Consequently, r\ may be chosen 
such that C2 is positive and the above probability of a deviation from the 
deterministic path is exponentially small in the small e limit. 

In this article we are interested in estimating the probability of particular 
deviations from the deterministic paths. Namely, we wish to study how the 
mass and the arrival time of a solution at coordinate T deviate from their 
value in the "frozen" deterministic system (i.e., when e = 0). In the absence 
of noise, the mass is a conserved quantity and for initial data being either 
or the arrival time remains equal to zero. 

We know from [22] that we may use the contraction principle to deduce 
from LDP for the paths a LDP for the mass at T and obtain a LDP with 
speed e and good rate function for an initial datum uq 

I u N °{m) = \ inf {IWlL3foT.L^}- 

In the case of a multiplicative noise, the mass is a conserved quantity. Thus, 
in this case, the mass cannot deviate from the constant value corresponding 
to that of the initial datum. 

Similarly, the mapping Y is continuous from E 1 / 2 into M. We may thus 
define by direct image the measures (// Y ( n '"°^ T ^) £ >o f° r an initial datum 
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uo in S. We obtain by contraction that they satisfy a LDP of speed e and 
good rate function 

the control map S is either that of the additive or multiplicative case. 

Let us briefly explain our strategy to estimate the probability of some 
event. Let us consider, for instance, the event D £ = {Y(u e '°(T)) £ [a, b]}, 
where [a, b] is an interval which does not contain 0. We use the LDP to 
obtain 

- inf I$r(y) < rimelogPfJU 

(2.8) 

< lime log P(D 6 ) < - inf I$(y). 

y£{a,b] 

To approximate from above the upper bound, we use energy type inequal- 
ities. These give a lower bound on the minimum L 2 norm of the control h 
required to change the deterministic behavior and have the arrival time in 
[a, b] at time T. Namely, we obtain a positive constant c such that 

HY(S(u ,h)(T)) G [a, b] then ±\\Hho,TV) > c 
This clearly implies 

hmelogP(L) e ) < -c. 

The second step is to find a particular function h such that Y(S(uo,h)(T)) £ 
(a,b) and c= (l/2)||/i|| 2 2 ( T . L2 ) is as small as possible. Then 

-c< lime log P(D E ). 

In this second step, we minimize on a smaller set of controls which gives rise 
to a problem a from the calculus of variations. 

The difficulty is to have sufficiently sharp energy estimates and to find a 
good solution to the control problem so that c and c are as close as possible. 
We see below that we are able to do so in some interesting situations and 
derive good estimates on such probabilities. 

Note that proceeding as in [22] for the mass, we may prove in the addi- 
tive case that inf yg j/y°(y) < oo for every nonempty interval J and any uq 
provided the range of $ is dense. Indeed, for every real number a, a solu- 
tion of the form u(t,x) = (1 + atx)uo satisfies Y(u(T)) = aTn 2 /3. Plugging 
this solution into equation (2.6), we find a control such that the solution 
reaches any interval at time T. Using the continuity of h i— ► Y(S a ' u ° (h)(T)) 
from L 2 (0,T;L 2 ) into R and the density of the range of <!>, we obtain 
inf y g j Jy° (y) < oo. This shows that the lower bound is nontrivial. With ar- 
guments given previously, we know that the upper bound is nontrivial as 
well. 
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Remark 2.2. Using similar arguments as in [22], we can prove that, for 
every positive R besides an at most countable set of points, we can replace 
lim and lim by lim in the LDP and obtain 

lime log F(Y(u e ' Ua (T)) > R) 

= -(l/(2e)) inf {\\h\\h( T-mh 

lime log P(Y(u e,u °(r)) < —R) 

= -(l/(2e)) inf {II^IIlVo™}- 

This uses the fact that a monotone and bounded function is continuous 
almost everywhere. 

We end this section with some remarks which will be useful in the de- 
velopment of our method when we consider the arrival time of the solution. 
Let us consider the case when the initial datum is The probability of 
tail events of the arrival time are related to the behavior of Y(S(^\, h)). 

An equation for the motion of the arrival time is given in [38] in the case of 
an external potential. The first step consists in multiplying the control PDE 
by —ixu, taking the real part, and integrating by part the term involving 
the Laplace operator. We then obtain for the control PDE associated to the 
multiplicative case 

(2.9) ^-Y(S m '^(h)(t)) = 2me(i [ S m ^A (h) (t, x) d x S m '*A (h) (t, x) dx] , 
dt \ m J 

while in the additive case we obtain 

d 



(2.10) 



^-Y{S a ^A(h){t)) = 2<Ri(i [ S a ^A(h){t,x)d x S a ^A(h)(t,x)dx 
dt \ Jr 

-mt\i J xS a ^A(h)(t,x)(<Z>h)(t,x)d x y 

Below, we use the notation 

P(u) = 29teft J u(x)d x u(x) dx^j , ueH 1 . 

As a consequence of (2.9), we see that, in the multiplicative case, the 
arrival time of the solution of the control problem cannot move unless its 
phase depends on the space variable. For instance, if the control is chosen so 
that the solution S(\I/^,/i) is a modulated soliton of type (1.2) with varying 
amplitude and group velocity, 

S(^4, h){t) = V2A(t) sech(A(t){x - x ) + 2A{t)V{t)t) 

x exp(-i(^l(t) 2 - V(tf)t + iV(t)(x - x ) + i0 ), 
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we have the well-known identity 

~Y(S(*«U)(t)) = -2V(t)N(S m ^(h)(t)) = -8V(t)A(t). 

It will be convenient to choose controlled solutions of the form above. Since 
the initial datum is we necessarily have V(0) = 0, hence, V cannot be 
chosen constant, otherwise the arrival time does not change. We will see that 
it is sufficient to have a constant, amplitude A in order to get sharp bounds. 
Thus, we will use modulated solitons as solutions of the control problem 
with constant amplitude when studying the motion of the arrival time. 

The first idea to find a control giving a solution whose arrival time or 
mass verify some desired property is to take the above modulated soliton 
and plug it into the control equation. This gives an explicit form of the 
control in terms of the various parameters. Then, we compute the space- 
time L 2 norm of this control. We obtain a function of the parameters which 
we can try to minimize thanks to the calculus of variations. This approach is 
not easy to perform, the function to minimize has a complicated form and is 
often singular. However, we are not interested in finding the exact extremal 
of the function and we use this method in an heuristic way. This allows us 
to guess good controls with low enough space-time L 2 norm compared to 
the upper bound and such that the controlled path is a modulated soliton 
satisfying the desired constraints. We will see that this method is successful. 

Let us consider the following controlled nonlinear Schrddinger equation: 

(2.11) i*^- = Au + \u\ 2 u + \(t)xu 

with initial datum The function A is taken in L 1 (0,T;R). This corre- 
sponds to the multiplicative control equation with <J>/i = \(t)x or to the addi- 
tive one with <I>/i = \(t)xu. We use well-known transformations to compute 
explicitly the solution of (2.11) which we denote by A- We first may check 
that the functions v\ and v 2 defined by vi(t,x) = exp(i(/ * X(s) ds)x)u(t, x) 
and v 2 (t, x) = exp(— % J (J Q S A(r) dr) 2 ds)v\(t, x) (gauge transform) satisfy the 
PDEs 

dvi d 2 Vl l2 / ft w , , \ 2 / /■* . , \dvx 



and 



dv 
~dt 



'2 x9 /( ( ,, \dv 2 \ d 2 V 2 2 



with initial datum We conclude using the methods of characteristics 
that i>3 defined by 

Vs(t, x) = v 2 \t, x + 2 J J X(u)duds^j 
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is a solution of the usual NLS equation with initial datum Thus, we 
obtain that vs(t,x) = ^?A{t,x) and that the solution of the Cauchy problem 
associated to (2.11) is 

\S> A X (t,x) = V2Asech^A(x - 2 J J A(r) drds^ 
x exp -iA 2 t + i J (J A(r)d-rj ds 

- ix J* \(s) ds + 2i (J A(a) ds^j (j* j* A(r) dr ds^j . 

We obtain a modulated soliton with group velocity given by V{t) = /q A(s) ds. 
In the additive case, it is possible to obtain a control such that the solution 
has the same arrival time and group velocity and such that the space-time 
L 2 norm of the control is simpler to compute. It is obtained thanks to the 
observation that using the gauge transform the solution of the Cauchy prob- 
lem 

r t rs 



A(t) drds \ v, 



(2.12) \ ,d it = *" + ^ V + X{t) { X - 2 L[ 
{ »(0) = <S A , 

is given by 

&A s \(t,x) =exp^2i J A(s) J j A(cr) da dr ds^V A,\(t,x) 



Remark 2.3. For the controls chosen above, relation (2.9) holds also 
in the additive case. Thus, the second term in (2.10) which, at first glance, 
could be useful to act on the arrival time is in fact useless. 

Also, it could be thought that the choice of more complicated group ve- 
locities could be useful. We have tried to consider a space dependent group 
velocity, but the calculus of variations approach indicates that optimality is 
reached when it does not depend on space. 

3. Tails of the mass and arrival time with additive noise. In the case 
of an additive noise, both the mass and arrival time may deviate from the 
deterministic behavior and result in error in transmission. 

We study tails and thus the probability of a deviation from the mean. 
The constant R below quantify this deviation. We are not really interested 
in large R. In the case of the mass, for example, interesting cases are when 
R lies in (0,4). But, since e goes to zero and the factor in the exponential 
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is of the order of 1/e while R is of order 1, it results in very unlikely events. 
These significant excursions of the mass and arrival time are exactly large 
deviation events. 

Moreover, another parameter is particularly interesting. It is T the length 
of the fiber optical line. It is assumed to be large. For example, we could 
think of a fiber optical line between Europe and America. 

We first recall the results obtained in [22] for the tails of mass of the pulse 
at the end of the line. We repeat the proofs for the reader's convenience. 
The aim is to compare these tails with the tails of the arrival time obtained 
thereafter. We show that indeed the timing jitter is the dominant effect in 
the error in transmission when the noise is additive. The initial datum may 
be uq = or uq = ^, where ^(x) = \/2sech(a;). We could consider a soliton 
profile with any amplitude A as well but, for simplicity, we consider the case 
A = 1. However, we consider below the parameter A for the timing jitter 
in order to compare with results from physics. Indeed, this dependance is 
made explicit in the case of the timing jitter in the physics literature. Let 
us begin with upper bounds of the tails. As already mentioned, they are 
obtained thanks to energy estimates. For the second bound, we consider the 
case of the emission of a signal. In that case only a decrease of the mass is 
troublesome and causes error in transmission. Thus, the bound given only 
accounts for a significant decrease of the mass. 

Proposition 3.1. For every positive T and R [R in (0,4) for the second 
inequality] and every operator <3? in /^(L 2 ,!! 1 ), the following inequalities 
hold: 

In^ e logP(N(u e ' (T)) > R) < -R/(8T\Ml c{L2)L2) ), 
melogF(N(u^(T))-4< -R) < -R 2 /(8T\ml c(L ^ L2) (A + R)). 

Proof. Multiplying by — iu the equation 

■ du . , , , 9 

i An - \\uru = ®h, 

dt 

integrating over time and space and taking the real part gives, for t £ [0,T], 

(3.1) |Kt)||L2 - |KH L 2 =29ie(-ij J (((f>h)(s,x)u(s,x))dxdsj 
and by the Cauchy-Schwarz inequality, 

(3.2) |Kt)||L2 - IKIIl2 < 2||$||£ c ( L 2 iL 2 ) ||/l|| i 2( 0]T;L 2 ) ||?i|| i 2 (0ir; L2). 

We integrate once more with respect to t S [0, T] and obtain 

(3.3) |Mll2( 0iT;L 2) - T||u |Il2 < 2T 1 ||$|| Z ; c ( L 2 iL 2)||/t|| L 2( 0)T . L 2)||u|| L 2(0 i T;L2)- 
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For the first inequality, uq = and u = S a '°(h). By (3.3), 

IIS 0- ' (fr)llx 2 (0,T;L 2 ) ^ 2T ll^ll£c(L 2 ^2)11^11x2(0^2). 

Then, taking t = T in (3.2), we deduce 

Thus, if A^(S a '°(/i)(T)) = ||S a '°(/i)(T)||2 2 = m , then 



T;L2) 



> 



777 



4T||$||2 



l-Cc(L 2 ,L2) 

It follows 

i^(m) = (1/2) inf {\\h\\h(o t-i? J 

JVV ^ W / heL2(0,T;L2):N(S".0(h)(T))=m UI ' ' ' 

>m/(8r||$||2. c(L2)L2) ). 
Now, by the LDP for the mass, we have 

IunelogP(N(n £ '°(T)) > R) < - inf I#(m) 

e^O me[-R,oo] 

and the result follows. 

For the second inequality, uq = \& and u = S a '*(/i). Since H^H^ = 4, (3.3) 
rewrites 

II U IIl2(o,T;L2) - 2 r|| $ IU c (L2 ) L2)||^||i2(0,T;L2)||'"||L2(0,T;L2) ~ 4T < 0. 

Therefore, 

lklU 2 (0,T;L2) < r||$||£ c ( L 2 jL 2)||/i|| L 2(o,T;L2) 



x 1 + 



1 + 



\ T ll^lll(L2,L2)IHIi:2 ( o,T;L2) 

By (3.2) with t = T we deduce that if N{S a '°(h){T)) = ||S a >°(/i)(T)||2 2 < 
4 — R, then 



i?<2r||$||^ {L2)L2) ||h||| 2(0)T . L2) (i + 

We finally obtain 



1 + 



c (L2,L2)II^IIl2(0,T;L2) 



II^IIl2(0,T;L2) 



> 



R 2 



c(L 2 ,L 2 ) 

(4 + i?)' 
The upper bound follows. □ 
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Let us now consider lower bounds. As explained above, our method is to 
find solutions of the control problem with mass at coordinate T satisfying 
constraints and such that the L 2 norm of the control is as small as possible. 
We find these controlled solutions in the form of modulated solitons. We 
have found that it is sufficient that only the amplitude varies. We take the 
solution of (2.6) of the form 

(3.4) V2A(t)exp(-i A 2 (s) ds^j sech(A(t)x). 

This is associated to the control 
*h A (t,x)=i(A'/A)(t)* A (t,x) 

- %y/2A'{t) exp (-ij A 2 (s) ds^J A(t)x (sinh/ cosh 2 ) (A(t)x). 

Unfortunately, the right-hand side is in general not in the range of $ . More- 
over, unless we make artificial assumptions on it is not possible to get 
information on the norm of h. In our result below, we proceed by approxi- 
mation and consider a sequence of operators $ n approximating the identity 
on a sufficiently large set containing the controls. 

Let us assume for the moment that we can consider the space-time white 
noise. Then, the mass of the solution (3.4) at time T is equal to 4A(T) and 
the L 2 norm of the control is given by 

(3.5) IIMlW™ = J(12 + 7T 2 ) [ T( ^^dt. 



The Euler-Lagrange equation associated to the problem of minimizing this 
quantity is 

A" (A^ 2 
2— 



A \A 

Multiplying this identity by A 2 and differentiating, we obtain A'" = 0, so 
that A is a second degree polynomial and it is easy to see that it has to be 
of the form A (t) = a(t - 0) 2 . 

For the problem of the zero initial boundary condition, we have Aq(0) = 
and AAq(T) > R. Hence, we deduce the candidates Aq(£) = R(-^f) 2 for R > R 
arbitrary and R defined as for the upper bounds. Plugging such a function 
into (3.5) gives 

(3-6) ll^ |l! 2{ o,T;L 2 ) = ^(12 + vr 2 )^. 

This would give immediately a lower bound if $ were the identity. Since 
we cannot treat this case, we proceed by approximation. The assumption 
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we make on the covariance involves the following sets of time dependent 
functions. We first introduce, for D C [R,R+ 1], 

Ar>= {A : [0, T) -> R, there exists R £ D such that A(t) = R(t/(2T)f}. 

The functions in A l D are the varying amplitude of the solutions correspond- 
ing to controls in the set 

Cjj = hi € L 2 (0,T; L 2 ), there exists A E A\> 

h(t,x) =i(A /A)(t)^ A (t,x) 

-iV2A'(t)exp(-i J A 2 (s) dsj A(t)rr(sinh/cosh 2 )(A(t)j;)|. 

For the case of a soliton profile as initial data, a similar argument leads us 
to define 

A 2 D = {A : [0, T] -> R, there exists RE D such that 

A(t) = (8 - R - 4 V / 4- J R)(V(2T)) 2 + (-4 + 2^4 - R){t/{2T)) + 1}. 

The set of controls is defined as above by replacing A l D by .A^,. 

We have the following proposition from [22]. The assumptions can eas- 
ily be fulfilled. They are such that the noise is as close as possible to the 
space-time white noise considered in physics that we are not able to treat 
mathematically. 

Proposition 3.2. Let T and R be positive numbers [R in (0,4) for the 
second inequality], take D dense in (R, R+ I) and a sequence of operators 
(<£> n ) ng N in £2(L 2 5 L 2 ) such that for every hECp we have & n h converges to 
h in L 1 (0,T;L 2 ). Then we obtain 

lim elogP(N(u £ '°' n (T)) > R) > -R{12 + ^ 2 )/(18T). 

n— >oo,e^0 

Replacing in the above Cjy by we obtain 

lim elogP(N(u e '*» n (T)) -4< -R) 

> -2(8- J R-4V4^R)(12 + vr 2 )/(36T). 
The exponent n is there to recall that $ is replaced by & n . 

Proof. We only treat the first inequality, the second is similar. Recall 
that by the LDP for the mass, we know that, for a fixed n, 

(3.7) lime log P(N(<u £ '°' n (T)) >R)>- inf I% n (m), 

e^O m>R 
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where 



^N,n( m )— 2 



inf 

heL 2 (0,T;L 2 ):N(S a > - n (fc)(T))=m 



{INIl 2 (0,T;L 2 )} 



where n means that <3? is replaced by $ n in the control equation. 

We take R > R and Ha defined above. Though the stochastic equa- 
tion is not defined when <E> = /, the control map makes sense for any h in 
L 2 (0, T; L 2 ). We denote it by S$£. By classical results (see [6], [10]), S$ N (h) 
is continuous with respect to he C([0, T]; L 2 (]R)) to C([0, T]; L 2 (R)). Thanks 
to our assumptions, we deduce that 



in C([0,T];L 2 (]R)). In particular, since N(S^ N (h Ao )(T)) = R, for n large 
enough, we have N(S a ' 0,n (hA )(T)) > R and the infimum in (3.7) is larger 



We conclude since R can be as close as we want to R. 

Note that the result in Proposition 3.1 depends on $ only through its 
norm as a bounded operator in L 2 . It is not difficult to see that there exists 
sequences of operators ($ n ) ng ^ satisfying the assumptions of Proposition 
3.2, that is, which are Hilbert-Schmidt from L 2 to L 2 and 3> n approxi- 
mates the identity on the good set of controls, and are uniformly bounded 
as operators on L 2 by a constant independent on T. For such sequences of 
operators, the upper and lower bounds given above agree up to constants in 
their behavior in large T. □ 

It is obtained in [19], for the ideal white noise and using the heuristic argu- 
ments recalled in the Introduction, that the probability density function of 
the amplitude of the pulse at coordinate T when the initial datum is zero is 
asymptotically that of an exponential law of parameter eT/2. The amplitude 
is a constant times the mass for the modulated soliton solutions considered 
[19]. Integrating this density over [i?/2,oo) and taking into account the dif- 
ferent normalization, we obtain lim e _ >0 elogP(N(n £ ' (r)) > R) = -R/T. It 
is in between our two bounds and very close to our lower bound. A surpris- 
ing fact is that we obtain our result by parameterizing only the amplitude, 
whereas in [19] a much more general parametrization is used. Both bounds 
exhibit the right behavior in R and T. Moreover, the order in R confirms 
physical and numerical results that the law is not Gaussian. On a log scale 
the order in R is that of tails of an exponential law. In such the 
Gaussian approximation leads to incorrect tails and error estimates. 



than 
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Let us now comment on our results in the case of a soliton profile as 
initial datum. In [19], the error probability when the size of the measure- 
ment window is of the order of the coordinate T is obtained. It is given for 
some constant c(R) by lim e _> elogP(N(?/'*(r)) - 4 < —R) = -c{R)/T. It 
exhibits the same behavior in T as in our calculations. The discussion on 
the behavior with respect to R is less clear. Our bounds are not of the same 
order. In [14, 31] the PDF of the mass at coordinate T for a soliton profile as 
initial datum is not Gaussian. The numerical simulations in [33] relying on 
the ansatz approximation also exhibit a significant difference between the 
log of the tails of the amplitude and that of a Gaussian law. Our lower bound 
indicates that again the tails are thicker than Gaussian tails. Thus, we give 
a rigorous proof of the fact that a Gaussian approximation is incorrect. 

Finally, it is natural to obtain that the tails of the mass are increasing 
functions of T since the higher is T, the less energy is needed to form a 
signal whose mass gets above a fixed threshold at T. Replacing above by 
under, the same holds in the case of a soliton as initial datum. 

Remark 3.3. The H 1 setting is not required here. We could as well 
work with L 2 solutions and a LDP in L 2 . However, it is required to work in 
H 1 for the study of the arrival time below. 

We now estimate the tails of the arrival time. As for the mass, the rate 
is hard to handle since it involves an optimal control problem for controlled 
NLS equations. We again deduce the asymptotic of the tails from the LDP 
looking at upper and lower bounds. We consider that the initial datum is 
~%>\ since only in this case the timing jitter might be troublesome. 

Let us begin with an upper bound. It is deduced from the equation of 
motion of the arrival time in the controlled NLS equation (2.10). 

Proposition 3.4. For every positive T, A and R and every operator 
in £2(L 2 ,£) ; the following inequality holds: 

EmelogP(Y(u e '*Vr)) > R) 

e— >0 



" 8T(2T + lf{AA + R/(2T + l))||<I>||£ e(La , E) ' 

Proof. Differentiating the right hand side of (2.10) with respect to time 
and replacing the time derivative of the solution with the corresponding 
terms of the equation we obtain 

4p(S a> *Vfc)(*)) =49*e / S a '^h)(t,x)(d x ¥h)(t,x)dx. 
dt Jr 
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Indeed, by successive integration by parts, all terms cancel besides the one 
involving the forcing term. Since Y(l^) = and P{^° A ) = 0, thanks to 
(2.10), we obtain the identity 

Y(S°-*a (/*)(*)) =mt( [ f / S a <*A (h) (a, x) (d x $h) {a, x) dx da ds 
\Jo Jo Jr 

rt 



2mt(i f f xS a ^A(h)(s,x)(<Z>h)(s,x)dxds 
\ Jo Jr 



From this identity it follows that the controls h in the minimizing set of the 
LDP applied to the event we consider necessarily satisfy 

^< Y(S^^(^)(^)) < 4r||^||^ c(L2;Hl) ||^|| L2(0 ^ ;L2) ||S a '^(^)|| L2(0 ^ ;L2) 

+ 2 ll^ll£ c (L 2 ,E)ll /l llL 2 (0,T;L 2 )l|S a,,I ' A (/l)||L2(0,T;L 2 )- 

Moreover, arguing as in the proof of Proposition 3.1, 

||S a '*^(/l)|| L 2(o ; T;L 2 ) < ^II$I|£ c (L 2 ,L 2 )INIl 2 (0,T;L 2 ) 



x(l + v /l + 4A/(r||$||2 c(L2jL2) ||/ l ||2 2{0iT;L2)) ). 



A lower bound on (1/2)||/i||£ 2 (q t-l 2 ) f° nows easily since x i— > x(l + \fl + 4/x) 
is increasing on The result follows. □ 

A lower bound is obtained considering controls suggested at the end of 
Section 2 and minimizing on the smaller set of controls. We define the fol- 
lowing set of control for A,T positive and D a subset of (0, oo): 

H% T = j/i G L 2 (0, T; L 2 ),h(t, x) = \{t) (x - 2 J* jf" A(r) dr ds^j V A ,\(t, x), 

with \(t) = 3R(T -t)/(8AT 3 ),ReD\. 

Proposition 3.5. Let T , A and R be positive. Assume that, for a dense 
subset D of [R, R+ 1], (<3? n ) ne N is a sequence of operators m/^2(L 2 ,S) such 
that for any h in a> ^nh converges to h in L 1 (0,T; E). Then we have the 
following inequality where the exponent n is there to recall that $ is replaced 
by 

lim elogP(Y(« £ >*>(T)) >R)> -n 2 R 2 / (128T 3 A 3 ) . 

n— »oo,e— >0 

Proof. We proceed as for Proposition 3.2. By the LDP for the arrival 
time Y, we know that for a fixed n a lower bound is given by 

" in t 7 ?,n(y)» 

y>R ' 
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where 

41(V)=1 {|l^ll^(o,T; L2 )}- 
fcGL 2 (0,T;L 2 ):Y(S a, *A' n (h)(T))=y 

Again, n is there to recall that in the control equation, <3? is replaced by $ n . 
To minorize this quantity, we again first treat the case $ = I and denote by 

^WN * ne control map when $ = /. 

It is not difficult to see that S^^(/t) belongs to L°°([0, T];S) when h 

belong to 1^(0, T;E). Moreover, the norm of S^}(h) in L°°([0, T];S) is 
bounded in terms of the norm of h in L 1 (0,T;S). A standard argument to 
prove this is to compute the second derivative with respect to time of the 

variance V(it) = J R x 2 \u(t,x)\ 2 dx with n = S^^ 1 (/i). Using the argument 
detailed in the proof of the LDP in Section 5 below, this implies that, for 

each t, the mapping h — > S^^ 4 (/i)(t) is continuous from L 1 (0,T; S) to S 1 / 2 . 



Therefore, for h G A , 

Y(S a ^>(h)(T)) = Y(S^($ n / i )(T)) 

(3-8) 

— > Y(S^^(/i)(T)) when n -> oo. 
Proceeding as above, we are thus led to find a control h with minimum 

a 

energy verifying the constraint ~Y(S^^(h)(T)) > R for some R> R. 

We search this control in the set Ht,a defined as H.t,a, but where A is 
not specified and only assumed to belong to L 2 (0,T;R): 

n TA = ! [ h€L 2 (0,T;L 2 ), 

h(t, x) = \(t) (x - 2 j J° A(r) dr ds^j tf^t, x),\€ L 2 (0, T; R) J . 



i) 



We have seen at the end of Section 2 that S^j^(h) = ^a,x f° r h G Ht,a- 
omputa 

? (0,T;L 2 ) 

in: 

h6L»(0,T;L!«):Y(S^(ft) 

ii 

heH T , A msZV(h)(T))>R 



T t 

Also, an easy computation gives Y(&a,\(T)) = 8A J J X(s)dsdt and, for 
h G H t ,a, IHIl2 (0 , T;L2) = (tt 2 /(3^)) Jo T a2 ( s ) We deduce 

inf <Tj0 IHIl 2 (o,T;L 2 ) 

< mf IHIl 2 (0,T;L 2 ) 



inf (tt 2 /(3A)) ! \ 2 {t)dt. 

AeL 2 (0,T;R): / Q T / Q * A(a) ds«>fl/(8A) •'O 
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Note that the constraint Jq Jq A(s) ds dt > R/8A is not a usual boundary 
condition in the calculus of variations. We therefore simply try to find a good 
guess that leads to a lower bound of the the same order in the parameters 
as the upper bound. We use the quantity C T A ^(X) defined by 

X 2 (t)dt-j [ [ X(s)dsdt, 
Jo Jo 

where 7 belongs to K. We then impose that our guess X^ p, is a critical point 

of C T A ^(A) and that it satisfies the constraint Jq J * X(s) ds dt = Rj (8yl). We 
obtain 

y T:Ak (t) = 3R(T-t)/(8AT 3 ). 
We do not claim that the minimization problem is solved, we simply write 

inf (vr 2 /(3^4)) C X 2 {t)dt 

AeL 2 (0,T;R): f^f*\(s)dsdt>R/{8A) JO 

< (tt 2 /(3A)) j\x* T Afi {t)) 2 dt = n 2 R 2 /(64A 3 T 3 ). 

Let us set 

h* R (t, x) = X* T A k {t) [x - 2 j° X* T AR (r) dr ds) (*, x). 

By (3.8), we have, for Re D, 

Y(S a '*>(^)(T))^ Y(S^(^)( T )) when n- oo. 
Therefore, for n large enough, 

Y(S a '*A' n (/i*j)(T)) >R. 

We deduce 

inf J*£(x) < ir 2 R 2 /(64A 3 T 3 ) 

and take the lim in n in the lower bound. Since this is true for R in a dense 
subset of [R, R + l], we deduce the result. □ 

The upper and lower bounds given in Propositions 3.4 and 3.5 are in per- 
fect agreement in their behavior with respect to R and to T when T is large. 
Indeed, as T is large compared to R, the upper bound in Proposition 3.4 is 
of the order of — i? 2 /(128T 3 j4||$||^. n 2 However, we have to be careful 
before doing such a comparison. Indeed, unlike for the mass, it does not seem 
that there exists a sequence of operators ($ n ) nS N satisfying the assumptions 



C T:AR (X) = (tt 2 /(3A)) / 

J 
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of Proposition 3.5 and such that ||3>n||£ c (L 2 ,£) is bounded uniformly in n. 
This explains why the behavior in A in the lower and upper bound seem 
contradictory for large A. We believe, however, that there exists a sequence 
satisfying the assumptions of Proposition 3.5 and such that ||$nlU c (L 2 ,E) i s 
bounded independently with respect to R and T, but not to A. 

It is, however, possible to obtain bounds that match with respect to their 
order in A. Unlike the framework of Proposition 3.5, suppose we consider the 
sequence of operators <&„ = (/- A + \x\ 2 I + i(— A + \x\ 2 I) k )~ 1 / 2 such that 
& n h — > <£/i for any h and <I> = (/ — A + \x\ 2 I)~ 1 ^ 2 . We may prove that for 
sufficiently large k & n are S-valued Hilbert-Schmidt operators. Also then 
ll^n||,c c (L 2 ,E) — ll < ^ > ll£ c (L 2 ,s) = 1 an d, thus, <3? n are bounded uniformly in n. 
We argue as in the above using that 

Y(S#$(<I> n 7£)(T))-> Y(S$$(*h%)(T)) when oo. 

Then for n large enough, the lower bound is given by — H^>IIl 2 (o t-e)/^- 
Then for large A, that is, very localized initial pulse allowing theoretically 
higher transmission rate, the order up to a multiplicative constant is now 
that of the square of the norm of the gradient. It is thus now — R/(T 3 A) 
which matches the order of the upper bound. 

Let us now compare our result with the results obtained in the physics 
literature. First, we note that we obtain that on a log scale the tails are 
equivalent to Gaussian tails. This is indeed the kind of result obtained by 
arguments from the physical theory of perturbation of solitons. We are miss- 
ing the pre exponential factors to conclude whether or not the tails are 
Gaussian. Sharp large deviations could allow to obtain these factors. 

Now, suppose the law were indeed Gaussian, then the asymptotic of the 
tails may be written in terms of the variance. By doing so, we find that 
the variance of the timing jitter is of the order T 3 . It agrees perfectly with 
the initial results of [24]. Also, the order in A — for the lower bound — and T 
seems to agree perfectly with the orders of the contribution of the additive 
noise to the variance of the timing jitter in equation (3.18) in [17]. However 
it is not clear that we wish to obtain the 1/A S order since in [24, 28], where 
the model is instead a juxtaposition of deterministic evolutions with random 
initial data in between amplifiers, the order in A is —c/A. It is what we obtain 
above when we assume for consistency of the assumptions that in the limit 
the noise remains localized. 

We end this section noticing that our result confirms the fact that, in 
the presence of additive noise, the timing jitter is more troublesome than 
the fluctuation of the mass when we consider the problem of losing a sig- 
nal. Indeed, for A = 1, we have found that the tails of the arrival time are 
the order of exp(— ci(i?)/(eT 3 )), while that of the mass are of the order of 
exp(— C2(R)/(sT)), which is clearly negligible compared to the first for large 
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T. In other words the tails of the arrival time are much thicker than that 
of the mass, implying much more frequent large fluctuations of the arrival 
time than of the mass. Error in soliton transmission is much more likely to 
be due to timing jitter rather than decay of the mass of the pulse. Recall 
that T is the length of a fiber optical line and is thus assumed to be very 
large. This result is called the Gordon-Haus effect in the physics literature. 

Remark 3.6. From an engineering point of view, it is possible to expo- 
nentially reduce the probability of undesired deviations of the arrival time 
by introducing inline control elements; see, for example, [19]. We could also 
use ideas given in [36] and optimize on such external fields for a limited 
cost or penalty functional. The new optimal control problem requires then 
double optimization. 

Remark 3.7. Note that the methodology developed herein could prob- 
ably be applied to the determination of the small noise asymptotic of the 
tails of the position of an isolated vortex, defined by § V argu(t,x) ■ dl, in 
Bose condensates or superfluid Helium as in [34] . There the physical pertur- 
bation approach along with the Fokker-Planck equation are used. The small 
noise acts as the small temperature. 

4. Tails of the arrival time in the multiplicative case. In the case of the 
multiplicative noise, the mass is a conserved quantity and we restrict our 
attention to the study of the law of the arrival time of the pulse when the 
initial datum is the soliton profile \&^. 

Again, let us begin with upper bounds. They are obtained from an equa- 
tion for the motion of the arrival time in the controlled NLS equation. 

From relation (2.9) and integration by parts, we obtain the equation in 
[38], 



d 



2 



(4.1) ^Y(S m '*A(h)(t))=2 I \S m >*A(h)(t,x)f(d x $h)(t,x)dx. 
dt l Jr 

We may thus deduce the next proposition. 

Proposition 4.1. For every positive T, A and R and every operator <1> 
in /^2(L 2 ,H S (R,1R)), where s > 3/2 the following inequality holds: 

hm £ logP(Y(u^(T)) > R) <-(^) \ A2T3m f ■ 

Proof. From equation (4.1), the fact that ^Y(S m '^(h))\ t=0 = P(tf^) = 
0, that for such values of s the Sobolev injection of W(R, R) into W 1,0 ° (R, R) 
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is continuous (see [18]), and that the mass is conserved and thus remains 
equal to 4, we obtain 

jY(S m >*A(h)(t)) < 8A||$|| £o(I(a)W i,oc (Dl)H)) ||/i|| Ll( o 1 ^ a) 

< 8AVt||$|| jCc ( L 2 iW i,oo( ffi)R ))||/l|| L 2(o,T;L 2 )- 

Then, since Y(^^) = 0, we obtain integrating the above inequality that 

R< Y(S m >*A(h)(T)) < (16AT 3 / 2 /3)||<I>|| £c(l2 ,w 1 -(r,r))^IIl 2 (o,T;L 2 ) 
and the conclusion follows. □ 

Let us now consider lower bounds. We need to find controls which have 
the desired effect on the arrival time. We have seen that, in the additive case, 
good controls are given by functions in TL^ T . Recalling the transformations 
on the equation made at the end of Section 2, we can equivalently take con- 
trols of the form X(t)x^fA,x which correspond to the solution *$>a.\- Thus, 
in the multiplicative case, a good control is given by h(t,x) = X(t)x. Unfor- 
tunately these controls do not belong to the range of <J> nor to L 2 (0,T;L 2 ) 
and are not admissible. We have not been able to justify the choice of such 
controls by an approximation argument. 

We therefore impose a new assumption that <I> takes its values in H s (R, R) © 
xR. In other words, we consider the slightly different equation 

idu e ' U0 = (Au e ' U0 + \u £ ' U0 \ 2 u e ' U0 ) dt 

(4.2) 

+ u £ ' u ° o yfidW(t) + V^xu £ ' uo o d/3(t), 

where (3 is a standard Brownian motion independent of W and the corre- 
sponding controlled PDE 

ij t S U0 (h l ,h 2 ) = AS U0 (h 1 ,h 2 ) + \S U0 (h u h 2 )\ 2 S U0 (h l ,h 2 ) 

+ S^ihukfiQh! + xS U0 (h u h 2 )h 2 , 

where hi belongs to L 2 (0,T;L 2 ) and h 2 belongs to L 2 (0,T;R), the initial 
datum is uq and in the sequel uq = We may guess by successive ap- 
plications of the Ito formula, multiplying u e,u ° by the random phase term 
exp(zxy / e/3(t)), and similar transformations as in Section 2 (stochastic gauge 
transform, stochastic methods of characteristics, . . .) that we should consider 
the function 
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It indeed satisfies equation (2.3) with the same initial datum. We deduce 
that 

n £ '"° (t, x) = exp (-ix^P(t) + is J* (3 2 (s) ds + 2ie/3(t) J* /3(s) ds* 

x u £ ' uo (t, x-2^J* /9(a) . 
A similar computation shows that 

S u °(hi,h 2 )(t,x) = exp(^-ix^/e J h 2 (s)ds + i j 

+ 2i h 2 (s)ds / / h 2 {u) 
Jo Jo Jo 

\hi)(t,x-2j J*h 2 (u) 



h 2 {u)duj ds 
du ds 



m,uo ( 



x S 



The functions u £,u ° and S u °(hi,h 2 ) are well-defined functions of L 2 (0,T;E) 
and we may compute their arrival times. We obtain a lower bound of the 
asymptotic of the tails of the arrival time of the new solutions. 

Proposition 4.2. For every positive T, A and R and every operator $ 
in C 2 (L 2 , H S (R, R)) where s > 3/2, the following inequality holds: 

lime log F(Y(u £ ^a(T)) > R) > -3R 2 / '(128 A 2 T 3 ). 

e^O 

Proof. Consider the mapping F from C([0, T]; S 1 / 2 ) x C([0,T];R) into 
R such that 



F(u,b) =[ \x\ u(t,x- 2 [ b(s)ds 

Jr \ Jo 



dx. 



Take u and v! in C([0, T]; S 1 / 2 ) and 6 and b' in C([0,T];R), then by the 
triangle and inverse triangle inequalities and the change of variables, we 
obtain 





F«&')l 






</ 


x + 2 [ b(s)ds 
Jo 




r T 

x + 2 b'(s)ds 
Jo 



+ 



[ x + 2 [ b'(s)ds 
Jr Jo 



u(T,x)\ 2 dx 
(\u(T,x)\ 2 -\u'(T,x)\ 2 )dx 



<2 



b(s)ds- / b'(s)ds 



\u(T, x)\ 2 dx 
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+ / \x\\\u(T,x)\-\u'(T,x)\\(\u(T,x)\ + \u'(T,x)\)dx 



+ 2 



T 







b'(s) ds 



\u(T,x)\ — \u'(T,x)\\(\u(T,x)\ + \u'(T,x)\) dx. 



We conclude from the inverse triangle and Holder inequalities that F is 
continuous. We may then push forward by the contraction principle the 
LDP for the paths of u £, ^a and of yfeft by the mapping F using a slight 
modification of the result of exercise 4.2.7 of [9] and obtain a LDP for the 
laws of Y({i £ '*a(T)) which is that of F(u e '*A 5 y/s/3) of speed e and good rate 
function defined as a function of the rate function of the original solutions 
and of the rate function I@ of the sample path LDP for y/ef3: 

Iy A (x)= inf (I^(u)+I p (b)) 

(u,bj:F(u,b)=x 

^\ mf {II^iIIl^O.T^ 2 ) + IIMl 2 (0,T;R)} 

(hito):F(S m ' A(h 1 ),J h 2 (s)ds)=x 



<\ inf {"' 

(fe ll fta):Y(S*A(h 1 ,ft 2 )(T))=a: 



l^l|lL 2 (0,T;L 2 ) + II^2|Il 2 (0,T;R)}- 



Thus, considering solely controls of the from (0, /12), we minimize in h 2 for 
7 in R, 

T rT ft 

hl(t)dt-j / / h 2 (s)ds, 
Jo Jo 

where we impose that 

Y(* Ah2 (T)) = 8A [ T f h 2 (s)ds = R>R. 
Jo Jo 

The conclusion follows. □ 

Remark 4.3. We may check that Y(u £ '*a) = Y(t2 e '*A) -8^j^(3(s) ds 
and that J Q T @(s) ds is a centered Gaussian random variable with variance 
T 3 /3. 

The corresponding upper bound for this modified stochastic NLS equation 

is 

TmTelogP(Y(ti £ '* 4(r)) > R) < -(3/16) 2 — 



~° U/ " U ^ 2r3 (II^HL(L 2 ,W^(R,R)) V1 )' 

Note that the lower bound does not require to consider a sequence of op- 
erators (<J>n) n eN and we may indeed compare the upper and lower bounds. 
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They are of the same order in T and in A. Note also that, as in the ad- 
ditive case, we obtain that on a log scale the tails are equivalently that of 
Gaussian tails. Also, our tails are of the order in T that we expect from the 
contribution of the multiplicative noise to the variance of the timing jitter 
in equation (3.18) in [17]. 

However, concerning the amplitude, it is not of the order of —c/A 4 as 
we would expect from [17]. This is probably due to the fact that we have 
considered a colored noise with a term x that grows linearly in time (the 
x variable). 

5. Proof of Theorem 2.1. We herein denote the variance of / in E by 
V(f)=J R \x\ 2 \fW\ 2 dx. 

Let us start with the additive case. We denote by v uo (z) the solution of 

!i— = Av + \\v - iz\ 2a (v - iz), 
at 
u(o) = u e£, 

where z belongs to X^ 2(7+2 ^ = C([0, T]; E) n L r (0, T; W 1,2<7+2 ) and r is 
such that 2/r = 1/2 — l/(2a + 2). We also denote by Q uo the mapping 

z i — ^ v u °(z) - iz. 

Note that u £ ' u ° = Q u ° (y/eZ), where Z is the stochastic convolution defined 
by Z(t)= $U{t- s)dW(s). 

We can check from similar arguments (as those of the proof of Proposition 
1 in [22]) that the stochastic convolution is a X^ T ' 2a+2 '^ random variable 
whose law fi z is a centered Gaussian measure. Let z belong to X i - T,2a+2 ^\ 
take s <t <T, the triangle along with the Holder inequalities, then compute 

/ \x\(\g u "(z)(t,x)\ 2 -\g uo (z)( s ,x)\ 2 )dx 

JR 

JR 

< \\G U0 (z)(t) - g u °(z)( s )\\ L ^v(\g^(z)(t)\ + \g^(z)( s )\) 
<2V2\\g u °(z)(t)-g u °(z)( s )\\ L2 

x (y/v(v^(z)(t)) + y/v(v»°(z)(s)) + y/v(z(t)) + y/v(z(s))). 

The application of the Gronwall inequality in the proof of Proposition 3.5 
in [11], along with the Sobolev injection allow to prove that g u °(z) belongs 
to C([0,T]; E 1 / 2 ). The computation above also shows that g u ° is continuous 
from X( T > 2<T+2 > S ) to C([0,T];E 1 /2). The general result on LDP for Gaus- 
sian measures gives the LDP for the measures \i Ze , the direct images of \i z 
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under the transformation x \— > \fex on 

x (t,2<t+2,e)_ We conc i uc i e w i tn tne 

contraction principle. 

In the multiplicative case, it is also required to revisit the proof of the 
LDP in [23]. As mentioned in Section 2, we only emphasize the adaptations 
of the proof of [23] required to state a LDP in C([0, T); E 1 / 2 ). Note that in 
the following <E>/i is replaced by §{-, where / belongs to Hj(0,T;H s (R,R)) 
which is the subspace of C([0, T]; H S (R, R)) of functions zero at time 0, square 
integrable in time and with square integrable in time derivative. The control 
map is then denoted by S m ' u °(f). 

We may check using the above calculation and the fact that, for every 
t e [0,T], S m > u °(f)(t) belongs to S that 

V(S^(/)(t)) < (4||S m ^(/)(t)|| 2 ([0>T];Hl) + V(n ))e r ; 

(see the arguments of the proof of Proposition 3.2 in [12]) used for the control 
map, that the control map is continuous from the sets of levels of the rate 
function of the Wiener process less or equal to a positive constant, with the 
topology induced by that of C([0,T];H s (R,R)), to C([0, T]; X 1 / 2 ). The only 
difference in the proof of Proposition 4.1 in [23], the Azencott lemma (also 
called the Freidlin-Wentzell inequality or almost continuity of the ltd map) 
is in step 2. It is the reduction to estimates on the stochastic convolution. 
We use 

V(«**»(i)) < (4||^(t)|| 2 c([0)T] .Hi) + V(n ))e T 

(see the proof of Proposition 3.2 in [12]), where v £ ' u ° satisfies v £ ' Uo (0) = uq 
and 

idv 5 ™ = ^Au e,fi ° + A|t/' So | 2 <V'"° + ^v £ ' a ° - (ie/2)F$v £ ^ dt 

+ ^V £M dW £ , 

/(•) = f $h(s)ds, W £ (t) = W(t) - = W(t) - (1/Vi) x 

J*$h(s)ds, F< 5> (x)=Y,j°=i($e j (x)) 2 and (e,)f = ± is any complete orthonor- 
mal system of L 2 . The bound remains the same as in [12] because of the can- 
celation of the extra term in the application of the ltd formula and the can- 
celation of the Ito-Stratonovich correction with the second order ltd correc- 
tion term when the ltd formula is applied to the truncated variance V r (v) = 
J R exp(— r|x| 2 )|x| 2 |-y(x)| 2 dx. 
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